CoDraw: Visual Dialog for Collaborative Drawing

نویسندگان

  • Jin-Hwa Kim
  • Devi Parikh
  • Dhruv Batra
  • Byoung-Tak Zhang
  • Yuandong Tian
چکیده

In this work, we propose a goal-driven collaborative task that contains vision, language, and action in a virtual environment as its core components. Specifically, we develop a collaborative ‘Image Drawing’ game between two agents, called CoDraw. Our game is grounded in a virtual world that contains movable clip art objects. Two players, Teller and Drawer, are involved. The Teller sees an abstract scene containing multiple clip arts in a semantically meaningful configuration, while the Drawer tries to reconstruct the scene on an empty canvas using available clip arts. The two players communicate via two-way communication using natural language. We collect the CoDraw dataset of ∼10K dialogs consisting of 138K messages exchanged between a Teller and a Drawer from Amazon Mechanical Turk (AMT). We analyze our dataset and present three models to model the players’ behaviors, including an attention model to describe and draw multiple clip arts at each round. The attention models are quantitatively compared to the other models to show how the conventional approaches work for this new task. We also present qualitative visualizations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Graphical Constraints in CoDraw

Constraint based draw programs require users to understand and manage relationships between drawing elements. By establishing constraint relationships among elements the user effectively programs the drawing's behavior. This programming task requires a more sophisticated visual interface than conventional draw programs provide. Users must have available — in a convenient format — information ab...

متن کامل

End Users Creating More Effective Software

End-User Software is created using a variety of different techniques and paradigms. The “creating” part is defined as the process of representing the desired program in a computer-understandable form, and entering that representation into the computer. Programs can be represented using textual languages, visual (also called graphical) languages, spreadsheets (which are often included as a type ...

متن کامل

Goal-oriented Dialog as a Collaborative Subordinated Activity involving Collective Acceptance

Modeling dialog as a collaborative activity consists notably in specifying the content of the Conversational Common Ground and the kind of social mental state involved. In previous work (Saget, 2006), we claim that Collective Acceptance is the proper social attitude for modeling Conversational Common Ground in the particular case of goal-oriented dialog. We provide a formalization of Collective...

متن کامل

An Investigation into the Relationship between Dialog and Narrative Elements of the Holy Quran from a Literary Perspective

After more than 14 centuries, the expressive inimitability of the holy Quran opens new dimensions for thinkers every day. One of these angles is the study of the story and its relation to the element of dialog. The purpose of this article is to look into this issue in the verses of the holy Quran. For this purpose, the methods and techniques of storytelling in the Quran, the characteristics of ...

متن کامل

Towards Situated Collaboration

We outline a set of key challenges for dialog management in physically situated interactive systems, and propose a core shift in perspective that places spoken dialog in the context of the larger collaborative challenge of managing parallel, coordinated actions in the open

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1712.05558  شماره 

صفحات  -

تاریخ انتشار 2017